DeepSeek v4

mentions 2 type Person feed RSS

// recent coverage 2 mentions

18:16

2026-06-16

blog.firetiger.com

large-language-models

Migrating from Claude to DeepSeek without breaking everything

Firetiger migrated its agent fleet from Claude to DeepSeek, achieving a 62% reduction in inference costs from $606K/yr to $231K/yr. The migration required extensive prompt engineering and evaluation t…

10:16

2026-04-27

ianbarber.blog

large-language-models

Loss Exploded.

Meta's FAIR team documented a series of training failures in 2021 for their OPT-175B model, including repeated loss explosions and learning issues that required extensive hyperparameter tuning and arc…

// co-occurs with top 8 entities

DeepSeek 2 Firetiger 1 Claude 1 Anthropic 1 AWS Bedrock 1 Sonnet 4.6 1 FAIR 1 OPT-175 1